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ABSTRACT 

This is the first of a pair of papers which address the problem of measuring the unred- 
shifted power spectrum in optimal fashion from a survey of galaxies, with arbitrary 
geometry, for Gaussian or non-Gaussian fluctuations, in real or redshift space. In this 
first paper, that pair weighting is derived which formally minimizes the expected vari¬ 
ance of the unredshifted power spectrum windowed over some arbitrary kernel. The 
inverse of the covariance matrix of minimum variance estimators of windowed power 
spectra is the Fisher information matrix, which plays a central role in establishing 
optimal estimators. Actually computing the minimum variance pair window and the 
Fisher matrix in a real survey still presents a formidable numerical problem, so here 
a perturbation series solution is developed. The properties of the Fisher matrix eval¬ 
uated according to the approximate method suggested here are investigated in more 
detail in the second paper. 
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1 INTRODUCTION 

The problem of measuring the power spectrum of fluctua¬ 
tions in the Universe in optimal fashion has received some 
attention recently (Vogeley & Szalay 1996; Tegmark, Taylor 
& Heavens 1997; and references therein). Vogeley & Sza¬ 
lay argue that the Karhunen-Loeve (KL) basis, the basis 
of signal-to-noise eigenmodes of the survey covariance, pro¬ 
vides the optimal basis for measuring the power spectrum. 
Tegmark et al. give an elegant review of what constitutes op¬ 
timal measurement of parameters, and discuss how to com¬ 
press (reweight and truncate) a KL basis to tractable size 
when dealing with large data sets such as the forthcoming 
2df survey, or the Sloan Digital Sky Survey. 

Tegmark et al. emphasize the central role of the Fisher 
information matrix, which is the matrix of second deriva¬ 
tives of minus the log-likelihood function with respect to 
the parameters, and whose inverse gives an estimate of the 
covariance matrix of the parameters. Tegmark et al. con¬ 
clude, in the final sentence of their §4, that “In general, it 
does not seem tractable to devise a data-weighting scheme 
which optimizes the Fisher matrix directly”. A principal aim 
of the present paper is to demonstrate that the problem is 
after all tractable, albeit in an approximation, for the case 
where the parameter set to be measured is the unredshifted 
power spectrum of galaxies. The paper which follows this 
one (Hamilton 1997, hereafter Paper 2) investigates in more 
detail the properties of the approximate Fisher matrix com¬ 


puted according to the procedure described in the present 
paper. 

I suppose that one has at hand a galaxy survey, in real 
or redshift space, in which the observed galaxy density at 
position r is N{r), and the selection function, the expected 
density of galaxies at r, is a function $(r) which is known or 
can be measured with negligible uncertainty. The observed 
overdensity 5{r) at r is then defined to be 


^ N{r)-<f>{r) 


( 1 ) 


The overdensity S here is taken to be defined in real (unred¬ 
shifted) space, but it is shown in ^ how the results gener¬ 
alize to the case of linear distortions in redshift space. 

The basic problem considered in this paper is to esti¬ 
mate the true unredshifted galaxy covariance function ^ = 
{SS) from the data in real or redshift space. The true covari¬ 
ance ^ of galaxies in the Universe differs from the observed 
covariance because of noise and because of finiteness of the 
sample. In the real representation the true unredshifted co- 
variance function is the correlation function ^(r), which on 
the assumption of statistical homogeneity and isotropy is a 
function of scalar separation r, while in the Fourier repre¬ 
sentation it is the power spectrum ^(fc), a function of scalar 
wavenumber k 


^(fc) = J e'^ ”^(r)dV = J jo{kr)^{r) Anr^dr (2) 
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where jo{x) = sin(a:)/a: is a spherical Bessel function. I ad¬ 
here thronghont this paper to what has become the stan¬ 
dard convention in cosmology for defining Fourier trans¬ 
forms, notwithstanding the extraneous factors of 27r which 
result. 

I start by considering, in the power spectrum ^(k) 
windowed over some arbitrary prescribed kernel G{k), 


i= G(fc)C(fc)47rfc^dfc/(27r)® 


( 3 ) 


The windowed power spectrum ^ can be estimated by an 
estimator ^ (the hat distinguishing the estimator from the 
true value which is quadratic in observed overdensities <5: 


C = / W{ri,rj)6{ri)S{rj)d\id\j 


( 4 ) 


Section 1^ derives an expression, equation (ps|), for that pair 
weighting of overdensities SiSj which formally mini¬ 

mizes the expected variance amongst quadratic estimators 
^ of the windowed power spectrum 

The inverse of the expected covariance (A^qA^/?) 
between minimum variance estimators of the power spec¬ 
trum is the Fisher information matrix for the power spec¬ 
trum, as discussed in In effect, the Fisher matrix defines 
the maximum amount of information that can be extracted 
about the power spectrum from a given survey, and is there¬ 
fore a fundamental quantity in designing optimal procedures 
for measuring the power spectrum. Paper 2 shows how the 
Fisher matrix can be used to construct a complete set of 
positive kernels yielding a complete statistically orthogonal 
set of windowed power spectra. 

Actually evaluating the Fisher matrix still presents 
a formidable problem, involving inversion of the covariance 
(ACijACki) of the survey covariance (eq. Q below), which 
is a rank 4 matrix of 3-dimensional quantities. A solution 
to the problem is proposed in and |^. In §P, it is shown 
that for Gaussian fluctuations, in the classical limit where 
the wavelength is short compared to the scale of the survey, 
the minimum variance pair window goes over to that derived 
by Feldman, Kaiser & Peacock (1994, hereafter FKP). The 
problem of generalizing the FKP pair window to the non- 
Gaussian case is addressed in but unfortunately I have 
been unable to find an elegant way to implement such a gen¬ 
eralization. In §^, a perturbation series solution of the Fisher 
matrix is developed, starting from the classical FKP solution 
in the Gaussian case. A pert urbation solution exists also in 
the non-Gaussian case, §6.2, but the lack of a simple non- 


Gaussian generalization of the classical FKP window makes 
the choice of zeroth order solution here less clear. Paper 2 
evaluates the Fisher matrix assuming Gaussian fluctuations 
and the zeroth order FKP approximation. 

Section ^ offers advice on implementing an approximate 
minimum variance pair weighting. Section @ summarizes the 
conclusions. 


2 MINIMUM VARIANCE WEIGHTING 
2.1 Prior 

To derive minimum variance measures, it is necessary to 
make prior assumptions about the origin of the variance in 


a survey. Such prior assumptions can be regarded as pri¬ 
ors in Bayesian model testing, or alternatively as reasonable 
guesses which in practice may yield near minimum variance 
estimates. 

In the first place, I assume that the expectation value 
of the (unredshifted) survey covariance is the sum of the 
cosmic covariance ^ and a Poisson sampling term 

C{ri, rj) = {5{ri)5{rj)) = ^{nj) -b SD{rij)^{ri)~^ (5) 

where nj = \rij \ and rij = Vi — Vj, and So denotes a Dirac- 
delta function. This assumption implies in particular that 
the (unredshifted) survey covariance C(ri,rj) at any pair 
of non-coincident points Ti ^ rj provides an estimate of the 
cosmic covariance 

C(r’i, r’j)ri7^rj = ^{fij) ■ (6) 

Secondly, I suppose that the expected covari¬ 
ance {ACijACki) of the survey covariance function Cij is 
a specified function. Under the same assumption that it is a 
sum of cosmic and Poisson sampling terms, the covariance 
{ACijACki) takes the general form 

{ACijACki) ~ CikCji 4 - CilCjk kiijkl ( 7 ) 

where Hiju is the expected 4-point correlation function of 
the survey, which is a sum of the cosmic 4-point function 
Vijki plus Poisson sampling terms where any two or more of 
the points ijkl coincide, 

Ti-ijU = Vijki + [(5D(rij)Cifci'l'“^ -b eye.] (6 terms) 

-b [SD{rij)5D{rki)^ik^i -b eye.] (3 terms) 

-b [SDirij)S D{rik)^ii^i ^ -b eye.] (4 terms) 

+ 5D{rij)5D{rik)6D{rii)^~^{l term) (8) 

with C,ijk the 3-point function. In estimating the cosmic co- 
variance ^{vij) from the survey covariance C{ri,rj), it is 
necessary to omit (or subtract off) the Poisson noise term 
where i = j, as in equation (|^. The covariance of the survey 
covariance with these Poisson terms removed is given by 

{ACijACki) = CikCji + CilCjk + Hijki (9) 

k^l 

where the 4-point function Hijki includes only those Poission 
sampling terms where i or j = k or I, not those where i = j 
or k — I, 

Hijki = rjijki + [5D{rik)Ciji^i^ + {i j,k ^ O] (4 terms) 
+ [SD{rik)SD{rji)^ij^~^^J^ + {k ^ Z)] (2 terms) . 

( 10 ) 

In the particular case of Gaussian fluctuations, the 4- 
point function Hijki (and likewise Hijki) vanishes, in which 
case the covariance of the survey covariance reduces to 

{ACijACki) = CikCji + CilCjk ■ (11) 

Although the results of this paper are not confined to the 
Gaussian case, the combination of poor knowledge of the 
4-point function, the additional complication of the non- 
Gaussian case, and the expectation that fluctuations may 
well be Gaussian on linear scales, means that the Gaussian 
hypothesis is a natural first choice for model testing, at least 
on sufficiently large scales. 
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2.2 Notation 

Henceforward, it is convenient to adopt an abbreviated no¬ 
tation in which Latin indices ij denote pairs of volume ele¬ 
ments at positions Vi and rj in a survey, while Greek indices 
a denote pair separations r^- Introduce the convention that 
replacing a pair index ij on a vector or matrix by a separa¬ 
tion index a means sum over all pairs ij separated by Va in 
the survey. Thus, in the real space representation. 


= / W'Ooilri — rj\ — rP) d^rid^r^ 


( 12 ) 


and similarly for matrices with more indices. Normalizing 
the Dirac delta-function in equation ( 0 ) on a 3-dimensional 
scale so that f Soirij — ra)47rr^drti = f Soirij — To) 
iTTrfjdrij = 1 ensures the usual interpretation ^ for ex¬ 
ample, = / G(r)^(r)dnPdr (see equation in the 

real space representation. 

To exhibit the derivation of the minimum variance pair 
weighting in as general a form as possible, it is useful to bor¬ 
row ideas from quantum mechanics, and to treat all quanti¬ 
ties, such as the covariance function ^a, or the pair weight¬ 
ing as vectors in a Hilbert space. Such vectors have a 
meaning independent of the particular basis with respect to 
which they might be expressed. For example, the covariance 
function is the correlation function ^(r) in the real space rep¬ 
resentation, or the power spectrum ^{k) in the Fourier repre¬ 
sentation, but from a Hilbert space point of view these quan¬ 
tities are the same vector, and in this paper I use the same 
symbol ^ to denote them both. (It would be nice to adopt 
the Dirac bra-ket notation, but unfortunately this leads to 
confusion with ensemble averages.) 

To be explicit, suppose that ipdr) constitute some ar¬ 
bitrary complete set of orthonormal separation functions, 
labelled by separation index a. Then the covariance func¬ 
tion in the -i/i-representation is related to ^(r) in the real 
space representation by 


Co = y tpa{r)^{r)d-Kr^dr , 
C(r) = . 


(13) 

(14) 


The summation convention for repeated indices is adopted 
in 0) and hereafter. The raised index on ?/)“ denotes the 
Hermitian conjugate of ija ; one of a pair of repeated indices 
is always raised, the other lowered. The orthonormality con¬ 
ditions on ija are 


/ 


drrr^dr = Soap 


(15) 


where Sdc/s denotes the unit matrix in ^-space (the sub¬ 
script D is retained to distinguish the unit matrix So from 
the overdensity 5; for discrete representations. So should be 
interpreted as a Kronecker delta rather than a Dirac delta- 
function); and 


{ri)ipc,{r2) = SD(ri2) ■ 


(16) 


Similarly, suppose that (jij (ri, r 2 ) constitute some arbitrary 
complete orthonormal set of pair functions, which by pair 
exchange symmetry may be taken to be symmetric in i <-> j 
and ri « r 2 without loss of generality. Then a pair function 
W’’^ in the (()-representation is related to W{ri,r 2 ) in the 
real space representation by 


= J (l)'Pri,r2)W(ri,r2) d\id\2 

W{ri,r2) = (j)ij[ri,r2) ■ 


(17) 

(18) 


Define the symbol Sym to symmetrize over its underscripts, 
as in 


Sym Cij = (Cij +Cji)/2 . 
(ij) 


(19) 


The orthonormality conditions on the pair functions (l)ij are 
r 

/ <)>ij(ri,r2)()>fci(^ii^2)dVidV2 = SymSoikSoji (20) 

J (kl) 

where Syrnf’i^i-^SoiODji is the unit matrix in ((>-space, and 


(j)'Pri,r2)(j>ii{r3,r4) = Sym5D{riO^D(r24,) 
( 34 ) 


( 21 ) 


where again Sym(34j(5_D(ri3)5£)(r24) is the unit matrix for 
pairs in the real representation. 

In the real space representation, the basis of or¬ 
thonormal separation functions is the set of delta-functions 
in real space, tpair) = Soir—ra), and summation over 
Q signihes integration over dirr^dra. The basis of pair 
functions in the real representation is similarly the set 
of pairs of delta-functions in real space, (jijOijf'O = 
Sym(jjj(5_D(ri—ri)(5_D(r2—rj), and summation over ij sig¬ 
nihes integration over dVidVj. In the real space representa¬ 
tion, the Hermitian conjugate of a real-valued function is of 
course itself. 

In the Fourier representation, the orthonormal sep¬ 
aration functions are ipa{r) = jo{kar) (compare 
eq. |0), and summation over a signihes integration over 
i-Kk^ka /. The orthonormal pair functions are expo¬ 
nentials 4 >ij{r\,r 2 ) = , and summation 

over ij signihes integration over d\id?kj/O' kY'. In Fourier 
space, raising an index i, which means take the Hermi¬ 
tian conjugate with respect to ki, is equivalent to replacing 
ki —ki, for functions which are real-valued in their real 
space representation, as is true for all functions considered 
in this paper. 

Let Dfj denote the delta-function SoHvi — rj \ — rP) 
in a general representation. Explicitly, with respect to arbi¬ 
trary orthonormal bases ipa{r) of separation functions and 
of pair functions, the matrix D“ is 


= / fo(ri2 - r)i/'“(r)(^ij(ri, r2) dvrr^dr dVidV2 


= / (I>ij{ri,r2)ii°‘{ri2) d\id\2 ■ 


( 22 ) 


Then in general the convention (|^ of replacing a pair index 
ij with a separation index a is equivalent to contracting with 
the matrix D“- 




(23) 


As already indicated above, in the real space representation 


is 


= Sndri -rp- rP) ■ 

In the Fourier representation is 
D“ = (27r)®fo(fci -f kO&D{ki -kO . 


(24) 

(25) 
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The second delta-function in equation (^ ) is normalized so 
f Soiki — ka)iT^k^dka = 1. Thus in Fourier space, if = 
W{—ki,—kj), then W°‘ = J W{—ka'n,kaTi)do/{iTT), where 
the integral over solid angle do is over all unit directions n. 

2.3 Minimum variance pair weighting 

Consider the problem of measuring the power spectrum 
windowed over some arbitrary kernel function G“ 

i = . (26) 

Equation is equation (|^ in abbreviated, representation- 
independent form. The windowed power spectrum ^ can be 
estimated by an estimator ^ which is a weighted sum over 
pairs SiSj of overdensities in the survey 

I = W*^Si5j (27) 

where W’’^ is any a priori pair window constrained to satisfy 

= G“ (28) 

and which vanishes along the diagonal in real space 

W{r,r) = 0 . (29) 

Equation (^^ is equation (^ in abbreviated, representation- 
independent form. In equation (|2^ , is the 

representation-independent symbol for integrated over 
all pairs ij at se par ation a in the survey, in accordance with 
the convention (|23|). The condition that the window VF*-’ be 
chosen a priori, independently of knowledge of the densities 
Si (as is done below), ensures that the window will be un¬ 
correlated with the density, and therefore that the estimator 
^ will be unbiassed. 

The estimate |, equation (^^, of ^ is valid because it 
is being assumed, equation (^, that the expectation value 
{5{ri)5{rj)) of overdensities in real space at points ^ rj is 
equal to the cosmic variance ^{va) at separation \ri — rj \ = 
Ta. The condition ( p9| ) that the pair window W{ri,rj) in 
real space vanishes at = rj ensures that the Poisson noise 
term in the survey covariance at rt = rj is excluded. Since 
the estimator ( p^ ) is valid in the real space representation, 
it follows immediately that it is valid in any representation. 

Imposing the condition ( p^ that the pair window van¬ 
ishes along the diagonal in real space, VF (r, r) = 0, is equiva¬ 
lent to excluding from the computation of W^^SiSj self-pairs 
of galaxies, in which the pair consists of a galaxy and it¬ 
self. Alternatively, it may be computationally convenient to 
include the Poisson contribution to W''^Si5j, by including 
self-pairs of galaxies, and to subtract the self-pair contri¬ 
bution as a subsequent step. Such a subtraction is always 
exact. 

The minimum variance pair weighting is that which 
minimizes the expected variance of the estimator (p?!), 

(Af) = W'^W’^‘{ACijACki) , (30) 

subject to the constraints ( p8[ ) and (p^. The condition ( p^ 
on the pair window kF*^ can be discarded provided that the 
covariance (ACijACki) used in equation ( po[ ) is taken to 
be the covariance (|^, in which the terms with r; = rj or 
Tk = ri present in the full covariance (Q) are excluded. The 
constraints ( p^ can be imposed by introducing Lagrange 
multipliers Xa, and by minimizing the function 


IT'"'lF'='(AG„AGfci) - 2Ac(IF“ - G“) (31) 

with respect to the weights VF*^ and the multipliers Aa- 
Setting the partial derivatives of the function ( pl| ) with re¬ 
spect to the multipliers Aa equal to zero recovers the con¬ 
straints (^^, while setting the partial derivatives with re¬ 
spect to the weights equal to zero implies 

VF'^' (ACijACki) -XM=0 . (32) 

Equation ( |3^ implies 

]¥":> = r‘"'“A^ (33) 

where jg defined to be the inverse of the covariance 

matrix (ACijACki), 

rpijki _ ^agAG)"^*^''"' (34) 

[so that (AGi, AG^„)r™"''* = Symt^kiAoi Soj] and 

is the integral of this matrix over all pairs kl separated by a, 
in accordance with the convention (p3|). For Gaussian fluctu¬ 
ations, equation (^^, the inverse T*-’*^* takes the simplified 
form 

Tijki ^ 1 , (36) 

2 (kl) 

but the analysis here is not restricted to the Gaussian case. 
The Lagrange multipliers Aa can be eliminated by sum¬ 
ming equation (p^ ) over pairs ij at fixed separation (5 (that 
is, by applying the operator 7?^), and imposing the con¬ 
straints (p8|): 

which shows Aa = T-^G^. Here again signifies 
integrated over all pairs ij separated by a, and all pairs kl 
separated by /3, according to the convention (p^): 

, (38) 

and T~^ denotes the inverse of Thus the minimum 

variance pair weighting (p^) is 

^ (3g) 

Equation (p^ gives that pair weighting kF'^ which min¬ 
imizes the expected variance of the power spectrum win¬ 
dowed over some arbitrary kernel G, equation (p^, amongst 
estimators ( |27| ) quadratic in the observed overdensities 5. 

3 FISHER MATRIX 

Consider the expected covariance between estimates ^ and 
5 of power spectra ^ = G“^a and = C'°‘^a windowed 
through kernels G“ and G'“. If both estimates ^ and G are 
made using minimum variance weightings kF*^ and kF'*-’, 
then a short calculation from equation (pffl shows that the 
expected covariance (A^AC) of the estimates is 

(AiAi') = ACijACki) = TZpC°‘C'^ . (40) 

Equation (^^ shows that the expected covariance matrix 
(A^aA^/j) between minimum variance estimates ^a and 
of power spectra is 

(ACaAC/j) = Tj . (41) 
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The important matrix given by equation (|3^), can be 
recognized as the Fisher information matrix (Tegmark et 
al. 1997, §2) for the case where the parameter set to be 
measured is the power spectrum, and the onl y s ource of 
noise is Poisson sampling noise, as set forth in § p.l| . 

Strictly, the Fisher matrix is defined as the matrix of 
second derivatives of minus the log-likelihood function with 
respect to the parameters. However, the central limit theo¬ 
rem ensures that estimates of quantities become Gaussianly 
distributed about their true values in the limit of a suffi¬ 
ciently large survey, so that the log-likelihood function is 
quadratic about its maximum. Correctly, it is only in this 
limit of a large survey that the Fisher matrix equals the in¬ 
verse of the covariance matrix. Here I tacitly assume that 
the survey at hand is large enough that the central limit 
theorem applies. It is to be noted that the statement of the 
central limit theorem, that an estimate is asymptotically 
Gaussianly distributed about its true value, is entirely dis¬ 
tinct from the question of whether or not the density fluctu¬ 
ations themselves form a multivariate Gaussian distribution. 
The central limit theorem applies to non-Gaussian as well 
as Gaussian density fields. 

Some of the power of Fisher matrix T°‘^ will become 
apparent in Paper 2. 


4 REDSHIFT DISTORTIONS 


The analysis hitherto addressed the problem of determin¬ 
ing the power spectrum from unredshifted data. In redshift 
space however, peculiar velocities of galaxies along the fine of 
sight distort the pattern of clustering. In this section I show 
how the results obtained so far extend to the case of measur¬ 
ing the unredshifted power spectrum from redshift data, at 
least for redshift distortions in the linear regime. I show that 
the minimum variance pair window differs from the unred¬ 
shifted case, equation (|48[), but that the Fisher matrix of the 
unredshifted power spectrum remains unchanged. The min¬ 
imum variance window depends on the linear growth rate 
parameter (3 ~ which is taken here to be a prior pa¬ 

rameter. I plan to address the issue of measuring (3 itself in 
a subsequent paper. 

Let a superscript (s) denote quantities measured in red¬ 
shift space. For fluctuations in the linear regime, the over¬ 
density in redshift space is linearly related to the over¬ 
density 5 in real space by 

= SS (42) 


where S is the linear redshift distortion operator, which in 
the real space representation is (Hamilton & Culhane 1996, 
equation [12]) 


s = i + p 




a{r)d 


dr2 j-Qj. 


(43) 


with /3 ~ /h the linear growth rate parameter, and a(r) 
the logarithmic slope of P times the selection function ‘I>(r) 
at depth r 


. . d\aP^(r) 

- 

a In r 


(44) 


Consider a weighted sum of products of overdensities in 
redshift space According to the relation (E^ 


between 5^"^ and 5, this weighted sum is (here Sp is the 
redshift distortion operator (M^ in a general representation) 

^ = W^^^SkSi (45) 

where the last equation can be regarded as defining a real 
space pair window LF'-’ equivalent to the redshift space pair 
window Equation (^^ shows that the real and red¬ 

shift space windows are related by 

where is the Hermitian conjugate of the distortion oper¬ 
ator S, which in the real space representation is 




1 -f/3VV"^ 


dr 




(47) 


Equation (^^ inverts to give the redshift space pair window 
in terms of its unredshifted counterpart IF*-’ 




(48) 


where 5"^“^ is the inverse, i.e. the Green’s function, of the 
Hermitian conjugate of the distortion operator S. This in¬ 
verse is known explicitly in cases where S can be diagonal¬ 
ized, which include the plane-parallel approximation (Kaiser 
1987), where S is diagonalized in the Fourier representation, 
and cases where a{r), equation (^), is a constant, for which 
S is diagonalized in the representation of logarithmic spher¬ 
ical waves (Hamilton & Culhane 1996, eq. [50]; note that 
the Hermitian conjugate of rj, the eigenvalue of —d/d\nr, 
in this equation is — 3 — tj; see also Tegmark & Bromley 
1995, who give an expression for the Green’s function S~^ 
for the particular case a{r) = 2). 

The minimum variance pair window IF*’ in real space 
was derived in ^ equation (^ ) . The minimum variance pair 
window IF^®^*-’ in redshift space is then given in terms of the 
minimum variance window IF*’ by equation (^). This is 
true because minimizing sO is equivalent to min¬ 

imizing IF*’(5i5j, the two expressions being equal by defini¬ 
tion (pbj). 

Although the minimum variance pair window differs 
between real and redshift space, the Fisher matrix of the 
unredshifted power spectrum is itself unchanged. This is 
because it is being assumed that the quantity being esti¬ 
mated is the unredshifted power spectrum, whose expected 
covariance matrix (A^c«A^^) depends on the survey geome¬ 
try and the cosmic covariance, but is independent of whether 
the data lie in real or redshift space. A pertinent considera¬ 
tion here is that, as emphasized by Fisher, Scharf & Lahav 
(1994), the selection function 'l>(r) in a flux-limited survey is 
a function of the true distance r to a galaxy, not the redshift 
distance. 

The same conclusion, that the Fisher matrix of the 
unredshifted power spectrum is the same whether the data 
fie in real or redshift space, is found if the analysis in is 
carried through in redshift space. The redshift space versions 
of relevant quantities are as follows. The covariance matrix 
(AC^P AC^f) of the survey covariance matrix in redshift 
space is related to its unredshifted counterpart (ACijACki) 
by 

(AClfACif) = srsps,^S^PACmr.AC^,) . (49) 


Similarly the inverse pdi-iki redshift covariance ma- 
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trix is related to its unredshifted counterpart by 


rp(s)ijkl _ ^^(a)\^-lijkl 

T mnpq q—H q—IJ Q—lk q—11 


(50) 


The relation between the matrix in redshift space and 

its unredshifted counterpart Dfj is determined by the re¬ 
quirement that 

DfeiVF''* = Si'" S/ = (51) 

in which the first equality follows from the relation ( p^ be¬ 
tween the pair windows and , and the second 

equality can be regarded as defining Equation ( |5l| ) 

shows that is related to Dfi by 

Ij tj o 


= D^iSi'"S/ . 

Finally, the Fisher matrix in redshift space is 

rp(s)aj3 _ jj(s)arjn(a)ijkl jj(s)l3 _ rpaf} 


(52) 

(53) 


in which the last equality follows from a short calculation 
from equations (|^ and (p^. Thus the Fisher matrix is 
the same matrix in both real and redshift space, as claimed. 


5 THE CLASSICAL LIMIT 
5.1 Gaussian Fluctuations 

Feldman, Kaiser & Peacock (1994; hereafter FKP) derived 
a minimum variance pair window oc [^(k) + 'h(ri)“^]“^ 
[i{k) + 'l?(rj)~^]“^ for measuring the power spectrum ^{k), 
valid for Gaussian fluctuations in the limit where the se¬ 
lection function varies slowly compared to the wavelength, 
which may be termed the classical limit. In this Section it is 
shown how FKP’s pair window, equation (|62|), emerges from 
the present analysis. The classical solution will be used in 
the next Section, as the starting point of a general per¬ 
turbative expansion for the minimum variance pair window 
and the Fisher matrix. 

The survey covariance matrix Cij, equation (|^, can be 
regarded as an operator which is the sum of two opera¬ 
tors, C = ^ + 4?“^, the first of which, the cosmic covari¬ 
ance (27r)®5£)(fei -I- kj)^{ki), is diagonal in Fourier space, 
and the second of which, the inverse selection function 
5D{ri — rj)^{ri)~^, is diagonal in real space. The two terms 
do not commute in general, but they do commute approx¬ 
imately, C(’'b)^(^j)~^ “ ®(^i)~^^(^b') ~ 0i in i'iis classical 
limit where the wavelength is much shorter than the scale 
over which the selection function varies. In the classical limit, 
wavelength and position can be measured simultaneously, 
and the two terms in the survey covariance Cij can be diag¬ 
onalized simultaneously: 

Cij =5Dijmi) + ^ri)-^] . (54) 

The eigenfunctions of Cij in this representation are wave 
packets localized in both position and wavelength, and Soij 
here should be interpreted as the unit matrix in this rep¬ 
resentation. Equation ( ]5^ ) means that Cij acting on any 
function localized around position ri and wavelength ki is 
equivalent to multiplication by ^{ki) + <E>(ri)“^. The inverse 
survey covariance follows immediately from 
equation (^), and is likewise diagonal when acting on func¬ 
tions localized in position and wavelength 


C-iij 


S% 

e(fci) + 4>(ri)-i ■ 


(55) 


For Gaussian fluctuations, the covariance {ACijACki) 
of the survey covariance is equal to CikCji + CuCjk, equa¬ 
tion © , and is therefore also diagonal in the classical limit, 
in a representation where the eigenfunctions are products of 
pairs of wave-packets localized in position and wavelength: 


{ACijACki) = {SoikSoji + SoiiSojk) 

X [?(fc0+^(»’i)"^]K(fct)+ 'l’(»’i)"^l -(56) 


This expression remains valid even when the wave-packets i 
and j are well-separated in position and/or wavelength: the 
delta-functions in equation (pq ) assert only that wave-packet 
i is the same as wave-packet k (or Z), and that j is the same 
as I (or k). The inverse of the covariance matrix (js^ 

is again diagonal when acting on pair functions localized in 
position and wavelength 

^ijki ^ 5)1,5^^^ 

4[^(fcO + 3>(rO-i][C(fe.0 + 4>(r.')-'] ■ ^ ^ 

The minimum variance pair window (|^ derived in 
^ involves the quantities D'^i and = 

DfjT'^'^'D^i, where is the operator ( p^ which effectively 
integrates over pairs ij at separation a. Now above is 

diagonal provided that it acts on functions which are local¬ 
ized in both position and wavelength. This can be accom¬ 
plished by choosing in a mixed representation with ij 
in real space and a in Fourier space, in which case Dfj is a 
spherical Bessel function 


Dij = 3o{karij) (58) 

with ru = \ri — rj\. Contracting the matrix in equa¬ 

tion (^^ with D'^i from equation (js^ ) replaces the Dirac 
delta-functions in the numerator of with and 

sets ki = kj = fcn. the latter being evident from the Fourier 
representation (|25| ) of Dfj. The resulting matrix is, in 
the mixed representation, 

rfija _ _ joikaTij) _ 

2[C(fc.) + 4>(rO-i]K(fe«) + 4>(r,)-i] ’ 

This is, up to a normalization factor, the FKP pair window. 
Operating again on in equation ( |59| ) with Dfj from 
equation (^) yields the Fisher matrix r“^ in the Fourier 
representation 


n jo{kcrij)jo{k0rij) d^nd^j 

In the same classical limit that the selection function 4? 
varies slowly over many wavelengths, the Fisher matrix 
is diagonal in Fourier space 

T- = - I,) J . («) 

From equations ( |39| ) and ( p^ it follows that the min¬ 
imum variance pair window for measuring the power 
spectrum ^{ka) at wavenumber ka in the classical limit is, 
in real space, 


W{ri,rj) 


_ joikaTij) _ 

2[C(fc„) + 4>(rO-i][^(fe.) + ‘I>(r,')-i] 


(62) 
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where a% is the reciprocal of the eigenvalue of the Fisher ma¬ 
trix T°‘^, this eigenvalue being the coefficient of the identity 
matrix (27r)®(5_D(fca — kp) in equation (|^, 

dV 

2[e(fcc)+4>(r)-i]2 

Equation gives the pair window correctly normal¬ 
ized to yield an estimate ^(ka) of the power spectrum at 
wavenumber ka 

i(ka) = j W{ri,rj)S{ri)S{rj)d\iA\j . (64) 

Equations (^) and are precisely FKP’s estimator of 
the power spectrum. 

The expected covariance (A^(fca)A^(fc^)) between es¬ 
timates (|^) of the power spectrum is, according to equa¬ 
tion (^), given by the inverse of the Fisher matrix T°‘^ in 
equation (|6l|), 

{A^{ka)^iikp)) = {2'K)^5D{kc, — kp) . (65) 

The delta-function here means that the variance diverges 
for ka = kp. To resolve the divergence, it is necessary to 
average estimates ^(ka) of the power spectrum over shells 
of finite thickness in fc-space. Of course the divergence is 
only an artefact of the classical limit: in reality the delta- 
function in equation (|^ ) has a finite coherence width of the 
order of the inverse scale of the survey. Thus it is necessary 
to average over shells broad enough to ensure that estimates 
of the power spectrum in neighbouring shells are effectively 
independent. So let ^{ka) denote the estimated power spec¬ 
trum (^) averaged over a shell of volume 14 about ka which 
is thin (hopefully) compared to the scale over which ^{k) 
varies, yet broad compared to a coherence length 

^{ka) = Vk~^ j dl4 (66) 

where dl4 = 4:'Kkadka/{2-K)^. FKP call 14 the coherence 
volume. The variance of the shell-averaged power spectrum 
^{ka) is then 

(A^ikaf) = al/Vt (67) 

which shows that the variance decreases inversely with the 
coherence volume 14, as expected for the variance of aver¬ 
ages of independent quantities. 



(63) 


5.2 Non-Gaussian fluctuations 

Unfortunately, the FKP approach does not generalize in an 
elegant way to the case of non-Gaussian fluctuations. The 
fundamental difficulty is that, whereas in the Gaussian case 
the cosmic and Poisson sampling terms in the survey covari¬ 
ance commuted in the classical limit of slowingly varying 
selection function $, in the non-Gaussian case the terms no 
longer commute. That is, there are three sets of terms in 
the survey covariance {ACijACki), equation (^), one inde¬ 
pendent of the selection function <E> (the cosmic term), one 
proportional to and one proportional to The co¬ 
efficients of these terms do not generally commute, for non- 
Gaussian fluctuations. If the three coefficients did commute, 
then it would be possible to find simultaneous eigenfunctions 
of the terms, from which localized ‘pair-wave-packets’ could 


be constructed, and it would be possible to proceed in the 
much the same way as the Gaussian case of the previous sec¬ 
tion, §5.1. It turns out that a scalar separation variable can 
always be defined in the space of such eigenfunctions, anal¬ 
ogous to the scalar wavenumber k in the Gaussian case, and 
the Fisher matrix would then be diagonal, in the classical 
limit, with respect to this scalar separation variable, in the 
same way that the Fisher matrix ( |6l| ) is diagonal in Fourier 
space, for Gaussian fluctuations in the classical limit. 

As it is, it appears that the best that can be done, in 
the classical limit of slowly varying selection function <1>, 
is to diagonalize the survey covariance {ACijACki} locally, 
which certainly is feasible. The disadvantage of this is that 
the eigenfunctions depend on the local value of the selection 
function 4>, and in particular the scalar separation variable 
associated with the eigenfunctions has a different meaning 
depending on the local value of $. Whilst it may be possible 
to make headway along these lines, here I do not pursue the 
issue further. 


6 PERTURBATIVE SOLUTION 

In this section I show how the matrix and hence 

the minimum variance pair window and the Fisher matrix 
T“^, can be evaluated perturbatively. In the Gaussian case, 
t he c lassical solution (in effect, the FKP approximation) of 
^.l| provides a natural zeroth order approximation. Indeed, 
in practical applications the zeroth order solution may be 
judged already to be adequate, as in Paper 2. For non- 
Gaussian fluctuations, a perturbation solution can still be 
developed, although the absence of a natural zeroth order 
approximation makes this solution, at least for the time be¬ 
ing, less useful. 


6.1 Gaussian fluctuations 

For Gaussian fluctuations, the inverse of the covariance 

matrix {ACijACki) simplifies to a symmetrized product of 
the inverse C~^ of the survey covariance G, equation (p6[). 
Thus to develop a perturbation series for T, it suffices to 
develop a perturbation series for C~^. 

Suppose that is a zeroth order approximation to 

the inverse of Cij. Multiplying the two matrices together 
yields (with indices dropped for brevity) 

CC-^ = l + e (68) 

where is a supposedly small matrix. The exact inverse 

C~^ is then given by the perturbation series 

1 0 1 1 1 2 , 

C~^ = c-^ +c-^ + c-^ + ■■■ 

= C-\l-e + P -) (69) 

with 

d-^ =C-\-£)" (70) 

provided that the series converges. In practice, it may be 
that the series (^^ converges for some elements of the ma¬ 
trix, but not for others. If e were diagonal for example, then 
the series for any particular diagonal element tp would con¬ 
verge or diverge depending on whether the absolute value of 
that element were less than or more than one. In the actual 
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case, the matrix e is only approximately diagonal, so that 
the convergence of apparently converging elements may be 
asymptotic - the series converges up to a certain point, but 
thereafter diverges, because of gradual mixing in of diver¬ 
gent parts of the matrix. Below, I assume that if the first 
few terms of the series for a particular element of the matrix 
C~^ are appearing to converge, then they are converging to 
the true value of that element. The complete inverse matrix 
C~^ can then be built up by combining pieces from several 
different choices of the initial guess C~^. 


To construct the zeroth order inverse C , start with 
the classical approximation To be specihc, interpret the 
delta-function 5’’^ in the numerator as lying in real space, 
and fix the wavenumber k in the denominator to be some 
constant ko, which will be adjusted later so as to make the 
first and higher order corrections small. Th e choice of ko is 
discussed at the end of this subsection S.l, and further in 
§1^ clearly one will want ultimately to choose ko to be close 
to the particular wavenumber(s) at which one is trying to 
estimate the the power spectrum, or its inverse variance, the 
Fisher matrix. Thus the zeroth order inverse C~A is taken 


to be (note that for functions which are real-valued in real 
space, the value of a quantity with a raised index is the same 
as that with a lowered index, in the real space representa¬ 
tion) 

C~^{ri,rj) = SD{rij)U{ri) (71) 


where U(r) is defined by 
U{r) = -- 

^ ^ “ 1 + akoMr) ’ 


(72) 


which may be regarded as the survey window, weighted in 
a certain way (the FKP way, in fact). 

According to the argument accompanying equa¬ 
tion (|55|), the approximation 0 to the matrix only 
has limited validity, namely it is valid when acting on func¬ 
tions which are sufficiently localized in Fourier space about 
wavenumbers k ~ ko that ^(fc) « ^(feo) (the approxima¬ 
tion (|^) would have general validity only if the power spec¬ 
trum were nearly that of shot noise, ^(fc) = constant). Thus 
it makes sense to pass into Fourier space, and to develop the 
perturbation expansion there. In the Fourier representation, 
the zeroth order inverse C~A is (it is useful to recall that for 

‘■J ' 

functions which are real-valued in real space, as are all the 
functions in this section, raising or lowering an index i in 
Fourier space is equivalent to changing ki —> —fe;; I adhere 
to the convention that a lowered index i signifies while 
a raised index signihes —ki) 

C-Aki,kj)^U{ki + kj) (73) 


where U{k) = J U(r) e‘*' ’'dV is the Fourier transform of 
U(r), the constant ko being held fixed as the Fourier trans¬ 
form is taken. In Fourier space, the survey window U(k) is 
expected to be a compact ball about k =J), of width ~ 1/r 
where r is the depth of the survey. Thus (7,))^ is expected to 
be near diagonal in Fourier space, with only elements i ~ j 
appreciably nonzero. 

Multiplying the survey covariance Cij , equation (|),by 
the approximation (|^) to its inverse, and Fourier transform¬ 
ing, yields the matrix eij dehned by equation ( |^ 

e(fei, kj) = U{ki + kj)[^{ki) - ^(fco)] . (74) 


Again, the expectation that U{k) is concentrated about 
k = 0 means that the matrix €ij should be near diagonal in 
Fourier space, with only elements i ~ j appreciably nonzero. 
Equation (|^) shows that the matrix e is the product of the 
potentially small factor ^(fci) — C(^o) times a factor U which 
is no larger than l/^(fco), equation ([t^. Thus it is expected 
that the series ( |^ in e should converge for wavenumbers k 
satisfying 


C(fc)-^(fco) 

C(feo) 


^ 1 . 


(75) 


The higher order perturbations C follow from equa¬ 
tion ( [to] ) with C~^ given by (^) and e given by (Q. The 
hrst order perturbation CH is 

C-\k,,k,) = -J U{k.-k)U{kj+k)[ak)-ako)]^{76) 

2_i 

while the second order perturbation C^j is 


mo - {(fco)iK(te) - . (”) 

The perturbation expansion of the matrix T 
0 12 

T + T + T -\ - (78) 


in orders of ^(fc) — ^(fco) follows from the Gaussian expres¬ 
sion ( |3^ for T in terms of C~^, and the perturbation ex¬ 
pansion dos] ) of C~^. In the real space representation, the 
zeroth order inverse Tijki is 


T{ri,rj,rk,ri) = i SymSD{rik)SDirji)U{ri)U{rj) (79) 

2 (ki) 

0 

while in the Fourier representation Tijki is 
T{ki,kj,kk,ki) = i SymU{ki + kk)U(kj + ki) . (80) 

2 (ki) 


The hrst order correction fijki is 

T{ki,kj,kk,ki) = Sym U{ki + kk)C~^{kj,ki) (81) 

(ij)ikl) 

2 

while the second order correction Tijki is 


2 


T(ki, kj,kk, ki) 


Sym \\c-Aki,kk)C-\kj,ki) 

(ij)(kl) 12 


+ u{ki + kk)c-Akj,ki) 


(82) 


The minimum variance pair window ( |39| ) involves the 
quantities and the Fisher matrix = 

The perturbation expansions of and 
follow straightforwardly from the perturbation expansion of 
above, equations (^^-(|82|), the matrix Df,' being given 
in the Fourier representation by equation (P5|). The zeroth 
order contribution to is 


^ 1 y u{-ki - k^)U{-kj + fe,) do,/(47r) , (83) 

while the hrst and second order perturbations are 
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r'^'“ = Sym U{-ki-kOC-^i-kj,kOdo^/{4Tr) (84) 

(ij) J 


and 

= Sym 
(ij) 


^c-\-ki,-kOc-\-kj,kO 


+ U{—ki — ka)C ^{—kj,ka) doQ:/(47r) . (85) 

For the Fisher matrix r“^, the zeroth order term is 

i7(fea + fe/3)|^dOadO/3/(47r)^ (86) 

while the first and second order terms are 

= - y [/(-fee - ki3)C-Hk^,kO docdo^/(47r)" (87) 


yQ/3 _ }_ 
^ 2 


and 

2 


a0 _ 


^'C-\kc,,k/3)f 


L2 


+ U{-ka-k/3)C ^{ka,kf3) doQdo/3/(47r)^ . (88) 


The time has come to choose the adjustable constant fco 
in the definition (^^ of the window U. Suppose first that it 
is the Fisher matrix by itself which is of interest. This is the 
case for example in Paper 2, where the Fisher matrix is used 
to design sets of kernels for measuring the power spectrum. 
The zeroth order approximation T“^ to the Fisher matrix 
in Fourier space is given by equation (|^. The presumed 
narrowness of the window U0t) about fe = 0 ensures that 
ka ~ k/ 3 . The perturbations T’“^ and T°‘^ can then be made 
small if ko is chosen close to ka and kp. The strategy adopted 
in Paper 2 is to set feo = [ka + kp)l2, and to retain only the 
zeroth order term T“^ of the Fisher matrix. It would also 
be possible to choose ko more crudely, if one were willing to 
retain additional perturbation terms. 

Once a particular kernel or set of kernels has been cho¬ 
sen, perhaps along the lines described in Paper 2, it remains 
to estimate the power spectrum windowed through such ker¬ 
nels. The minimum variance pair window ( ^9| ) for estimating 
the power spectrum windowed through a specified kernel 0°“ 
involves both and the Fisher matrix r“^. If the ker¬ 

nel concerned is narrow about some wavenumber, then it 
would be natural to set fco equal to that wavenumber in ap¬ 
proximating T'i°‘ and r“^. The problem of how to apply an 
approximate pair window is discussed further in ^ 


6.2 Non-Gaussian fluctuations 

The perturbation expansion of the inverse of the sur¬ 

vey covariance can be carried out for non-Gaussian fluctua¬ 
tions, at least in principle, in much the same way as in the 
Gaussian case. ^ 

Suppose that is an approximate inverse of the co- 
variance matrix {ACijACki), where the latter now includes 
the non-Gaussian terms, equation (^. Multiplying the two 
matrices together yields 

{ACAC) T=1 + £ (89) 

where is a supposedly small matrix (note the slightly 
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different font for e here versus e in [Q). The exact inverse 
is then given by the perturbation expansion 

T = {ACAC)~^ =T{l-e + £^ -) (90) 

provided that the series converges. 

Unfortunately, it less clear here what to choose for the 
zero th order approximation, since, as already discussed in 
^.2| , there does not appear to be an elegant non-Gaussian 
generalization of the classical FKP approximation in the 
Gaussian case. Lacking such a generalization, I do not pur¬ 
sue the matter further here. 


7 APPLICATION OF AN APPROXIMATE 
PAIR WINDOW 


The approximations to the matrix and the Fisher ma¬ 
trix suggested in §|q yield approximations to the mini¬ 
mum variance window ( p9[ ) derived in §^. The approximate 
pair window is of course not the same as the true mini¬ 
mum variance pair window. In applying an approximate pair 
window, one should be careful about three points. Firstly, 
whatever pair window is adopted, it should at least give an 
unbiassed estimate of the quantity one wishes to measure - 
that is, the expectation value of the estimator should equal 
the quantity it is desired to measure, even if the variance in 
the estimator is not minimal. Secondly, the estimated un¬ 
certainty in the estimate should be the uncertainty in the 
actual estimator used, not the uncertainty in the minimum 
variance estimator. Thirdly, one should be aware that the 
minimum variance estimator for data in redshift space is not 
the same as the minimum variance estimator for data in real 
(unredshifted) space; the two are related by equation (^8|). 
These issues are discussed below. 

Suppose that the decision has been made to estimate 
the power spectrum f = C^^a, equation (^ or equiva¬ 
lently (|^), windowed through some kernel G“. How to 
choose such kernels is the subject of Paper 2, which illus¬ 
trates several examples, such as in Figures 3, 4, and espe¬ 
cially Figure 5. As discussed in § |2.3| , a pair window W’’^ will 
give an unbiassed estimate equation (^) or (p^), of the win¬ 
dowed power spectrum ^ provided that = G“ (here W°‘, 
equation 0) or (^^, signifies W’’^ integrated over all pairs 
ij at given separation a in the survey, in accordance with 
the convention established in 5^.21). The minimum variance 


pair window derived in is W''^ = T''^°‘T~pC^, equa¬ 
tion (|^). But suppose that in place of the true matrix T, it 
has been decided to use an approximation T. Consider then 
the approximate pair window defined by 

^ (91) 

Contracting equation (§) with D’lj (i.e. integrating over all 
pairs U at given separation a) shows that the pair win¬ 
dow (|9l|) satisfies 


IU“ = G“ 


(92) 


and therefore yields an unbiassed estimate of which is as 
desired. The important point here is that the approximation 
used for in the pair window ( pl|) should be the same as 
the approximation used for that is 

f°‘l^ = . (93) 
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The above point may seem rather obvious - why would 
there be any reason not to use the same approximation for 
both and t°‘^ in the pair window (pi|)? The answer 

lies in the choice of the adjustable constant fco which enters, 
via the window U{r), equation (|^), the approximations pro¬ 
posed in § pT] for the Gaussian case. At the end of it was 
suggested that in evaluating the Fisher matrix T{ka,k/3) it 
would be appropriate to set fco = {ka + k/ 3 )j 2 , but in esti¬ 
mating the power spectrum windowed through some kernel 
G(fc) narrow about some wavenumber, it would be appro¬ 
priate to set fco equal to that wavenumber. According to 
the discussion of the previous paragraph, for an estimate 
of the windowed power spectrum to be unbiassed, it is es¬ 
sential that the same fco be used for both T'^°‘ and T°‘^ in 
the pair window ( 0 ) . In other words, one should not use 
fco = {ka + kji)l2 for in the pair window (0, even if 
the kernel G(fc) was constructed in the first place from a 
Fisher matrix using that approximation. 

It is useful to write out explicitly the relevant equations 
for the simplest and most practical^case, where one uses the 
zeroth order approximation T = T in the perturbation se¬ 
ries of §3.1, for Gaussian fluctuations. Suppose that a kernel 
G(fc) has been chosen which is suitably narrow in Fourier 
space about some wavenumber. For such a kernel, it would 
be appropriate to set fco equal to that wavenumber, a fixed 


constant. Section 3.1 gives the zeroth order expressions for 


and in Fourier space. As long as fco is taken as a 
fixed constant, these expressions can be transformed back 
into real space, yielding 


and 

= \&D{ra-rfi)R{ra) 
where 


R{ra) = / U{ri)U{rj)SD{rij - ra)d\i(frj . 


(94) 

(95) 

(96) 


The quantity R{ra), which is often denoted {RR) in the lit¬ 
erature, is the expected number of weighted random pairs in 
the survey per unit volume interval 47rr^drc« of separation 
about ra- It should be emphasized that the expression ( |95[ ) 
is not a valid general expression for the Fisher matrix; it is 
valid only in that the Fourier transform of this expression 
is a good approximation to the Fisher matrix T{ka,k/3) for 
matrix elements satisfying ka ~ kp ko. The pair weight¬ 
ing which follows from the zeroth order approxima¬ 
tions (H) and (p^ is then 


W{ri,rj) = 


U{ri)U{ri)G{nj) 

R{rij) 


(97) 


where G(r) = f G(fc)e“‘*’ '’d^fc/(27r)® is the kernel G(k) in 
the real space representation. The pair weighting (M) has 
a simple interpretation. The U{ri)U{rj) factor, with U{r) 
given by equation ([t^), is just the FKP pair weighting, while 
the R{rij) in the denominator serves as a normalization fac¬ 
tor which ensures the correct overall weighting of pairs at 
separation rij. If, as is being assumed, the kernel G(fc) is nar¬ 
row in Fourier space, then the kernel G(r) in real space will 
presumably be broad and wiggly. In effect, the pair weight¬ 


ing (0 replaces the factor Uajoikarij)/2 in the original 
FKP pair weighting (p^ by the factor G{rij)/R{rij). 

The second issue mentioned at the beginning of this 
section is that the estimated uncertainty should be the un¬ 
certainty in the actual estimator used, not the uncertainty 
in the minimum variance estimator. In general, the expected 
uncertainty in an estimate equation (0) or (p^), made us¬ 
ing an arbitrary pair window IF*-’ is 

(Af) = tF'^'lF''*(AGi,-AGfei) . (98) 

In the particular case of the zeroth order pair window ( p^ , 
the uncertainty in the corresponding estimator is 


(AC^) = 


2G{ra) 

R{ra) 


■ d Fq 


[ AG{ra)T{ra,r0)G{r,3) 3 3 

- / - p, sp. X -drcdr;3 (99) 

J R{ra)R{ri3) 

1 

where Tirg^rp) on the second line is the first order correc¬ 
tion term ( pT] ) to the Fisher matrix, but expressed in real 
rather than Fourier space. Since this perturbation term is 
small, a satisfactory approximation to the uncertainty (A^^) 
in the estimator should be 

f2G{ra)\z 


(Af) 


R(ra) 


■dFo 


( 100 ) 


The estimate (IOC) of the uncertainty increases monoton- 
ically as the constant g(fco) in the pair window increases, 
since U{r), equation ([t^, hence R{r), equation (|^), de¬ 
creases as i{ko) increases. Thus it might be reasonable to 
allow for the small correction to the uncertainty arising from 
the perturbation term on the second line of equati on (|99| ) by 
computing the uncertainty from the main term ( [lOCy with 
a value of ^(fco) enhanced somewhat over that used in the 
pair window IF*-’ itself. Presumably ^(fc) varies somewhat 
over the extent of the kernel G(fc), providing a natural es¬ 
timate of the amount by which ^(fco) should be enhanced. 
This procedure for estimating the uncertainty in the estima¬ 
tor based on the zeroth order pair window (|^) is used in 
Paper 2, Figure 6. 


The uncertainty (100) in the approximate estimator 
should be somewhat larger, hopefully not by much, than 
the uncertainty T~^G°‘G^ in the minimum variance estima¬ 
tor. Gomputing both uncertainties would give an estimate 
of how much worse the approximate estimator is compared 
to the minimum variance estimator, and would also provide 
a useful numerical check. 

The third issue remarked at the beginning of this section 
is the fact that the minimum variance pair window for data 
in redshift space is not the same as the minimum variance 
pair window for data in real space: the two are related, at 
least for linear redshift distortions, by equation (|4^. How¬ 
ever, equation (^^ is only part of the solution to the prob¬ 
lem of measuring the power spectrum in optimal fashion 
from data in redshift space. A full solution would involve, 
at least, simultaneous measurement of the linear distortion 
parameter fi « /b. I hope to examine this problem fully 

in a subsequent paper. 

In the meantime, it should be remarked that there is 
no harm in applying an approximate pair window, such as 
that given by equation (^^ for example, to data in redshift 
space, as long as one recognizes that such a pair window may 
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not necessarily be near minimum variance. By construction, 
the pair window (^) applied to densities in redshift 

space will give an unbiassed estimate of the angle-averaged 
redshift space power spectrum windowed over the 

kernel G{k), for any choice of window U(r). However, the 
uncertainty in the resulting estimator will not be given by 
equation but rather by 

(Af) = ) (101) 

where ) is the covariance of the covariance of 

the survey in redshift space. This redshift space covariance 
is given in the case of linear redshift distortions by equa¬ 
tion 

8 SUMMARY 

This is the first of two papers which address the problem 
of estimating the unredshifted power spectrum in optimal 
fashion from a galaxy survey. Two principal results have 
been obtained here. 

The first result is a derivation of the minimum vari¬ 
ance pair weighting, equation , for estimating the unred¬ 
shifted power spectrum windowed over some arbitrary pre¬ 
scribed kernel. The minimum variance pair window is valid 
for arbitrary survey geometry, and for Gaussian or non- 
Gaussian fluctuations. The generalization of the minimum 
variance window into redshift space, for linear redshift dis¬ 
tortions, is given in ^ although the problem of measuring 
the linear redshift distortion parameter /3 ~ itself is 

not addressed here. 

The second principal result is a practical, albeit approx¬ 
imate, procedure for evaluating the Fisher information ma¬ 
trix T°‘^. The Fisher matrix is the inverse of the expected 
covariance matrix of minimum variance estimates of power 
spectra, and in effect determines the maximum amount of 
information about the power spectrum which can be ex¬ 
tracted from a survey. The proposed procedure for evaluat¬ 
ing the Fisher matrix is a perturbation expansion, in which 
the zeroth order approximation is essentially the FKP ap¬ 
proximation, valid for Gaussian fluctuations in the classical 
limit where the wavelength is short compared to the scale of 
the survey. This zeroth order approximation to the Fisher 
matrix is used in Paper 2. 
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